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ABSTRACT 


In this report, a mathematical model of a data 
communications system using retransmission error 
control is developed. System performance is spec¬ 
ified by information throughput, the amount of error- 
free data transferred per unit of time. Information 
throughput is determined as a function of message 
length, taking into account transmission delay, chan¬ 
nel error rate, and overhead words in the data 
message. 


REVIEW AND APPROVAL 

Publication of this technical report does not constitute Air Force approval 
of the report's findings or conclusions. It is published only for the exchange 
and stimulation of ideas. 



C.V.HORRIGAN 
Acting Director 
Aerospace Instrumentation 


iii 



TABLE OF CONTENTS 


Page 


LIST OF ILLUSTRATIONS vi 

LIST OF TABLES vi 

SECTION I INTRODUCTION 1 

SECTION E THE DATA COMMUNICATIONS SYSTEM 2 

SECTION HI ERROR CHARACTERISTICS OF THE CHANNEL 4 

SECTION IV THE MATHEMATICAL MODEL 5 

GENERAL 5 

AVERAGE LENGTH OF AN ERROR-FREE 
SEQUENCE 5 

AVERAGE NUMBER OF RETRANSMISSIONS 
WITH ERRORS 6 

TRANSMISSION OF N WORD MESSAGES 7 

INFORMATION THROUGHPUT 9 

SECTION V CONCLUSIONS 31 

REFERENCES 33 


-v- 



LIST OF ILLUSTRATIONS 


Figure Number Page 

1 Block Diagram of the Data Communications 

System 3 

2 Probability Error of N Word Messages P(N) 

Versus N and Word-Error Probability P 8 

3-18 Information Throughput T Versus Message 

Length N and Probability of Error P 12-27 


LIST OF TABLES 


Table Number 


Page 


I 

II 

m 

IV 

v 

VI 


Values of P, N, D, and I 
Solution of Equation (19) 

Optimum Message Length in 
to l-10" p Versus D and 

Optimum Message Length in 
to 1-10" 3 Versus D and 

Optimum Message Length in 
to 1-10" 4 Versus D and 

Optimum Message Length in 
to 1-1(T 5 Versus D and 

Optimum Message Length in 
to 1-10 _B Versus D and 


used in Computer 





11 

Words for 

I 

P 

Equal 

28 

Words for 

I 

P 

Equal 

28 

Words for 
I 

P 

Equal 

29 

Words for 

I 

P 

Equal 

29 

Words for 

I 

P 

Equal 

30 


-vi- 






SECTION I 


INTRODUCTION 


In digital data communications, error detection and retransmission is 
one method of obtaining essentially error-free transfer of data over a bursty 
channel between a data source and a remotely located data sink. A perform¬ 
ance measure of a data communications system is information throughput , 
the amount of error-free data transferred per unit of time. In this report, a 
previously developed mathematical model^^ is extended to permit determina¬ 
tion of information throughput as a function of message length, taking into 

account transmission delay, channel error rate, and overhead words in the 

[ 2 ] 

data message. Kuhn , who previously examined this problem, did not 
include in his analysis the effects of transmission delay or overhead words. 

The data communications system, the mathematical model of the system 
for which the throughput is calculated, the error characteristics of the channel, 
and the graphical solutions to the throughput equation, which permits determina¬ 
tion of optimum message length for maximum throughput under changing 
operating parameters of channel error rate, delay, and message overhead, 
are described in this report. 
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SECTION n 


THE DATA COMMUNICATIONS SYSTEM 


A functional block diagram of a typical data communications system 
is shown in Figure 1. A data source sends a sequence of digital information 
via a noisy channel to a remotely located sink. The data are encoded into 
information blocks called words. The data words are assembled into message 
blocks containing N words. A data word is considered in error if one or 
more errors appear in the received data word. A message is considered in 
error if one or more of the N words in the message are received in error. 

The sink is assumed to have the capability always to detect errors in 
the received data. This is a reasonable assumption since very powerful 
error-detecting codes are available. If a message in error is detected, the 
sink requests the source to go back in the data sequence to the message 
received in error and recommence transmission. The sink then ignores all 
incoming messages until the retransmitted data message is received cor¬ 
rectly. 

The time interval between the detection of a message in error and 
reception of the retransmission is the sum of the detection time and the 
round-trip transmission time of the communications system. The error- 
detection time is the time it takes for the sink to receive the message or 
N word intervals. The round-trip transmission time of the communica¬ 
tions system is D word intervals. 

Of the N words in each message, I words are overhead, i.e., they 
carry no information to the data sink. These I words are used to identify 
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NOISE 



Figure 1. Block Diagram of the Data Communications System 


the message, provide error detection, and maintain system synchronization. 
The remaining N-I words of the message contain the useful information 
transferred over the communications system. 
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SECTION m 


ERROR CHARACTERISTICS OF THE CHANNEL 


Error measurements on various communication media, such as high- 
frequency radio and troposcatter, indicate that these communications 
channels are bursty, i.e., the error occurrences are highly correlated. 


Attempts have been made to model these channels and to list their relevant 
statistical characterise 
channel error patterns. 


[3 4] 

statistical characteristics ’ in order to describe the behavior of the 


A quantity commonly used to describe the channel is average word- 
error rate, P. This number is found by averaging the number of words in 
error in a fixed interval of the data stream over the total number of words 
in the interval. For example, if 10 words were in error in an interval of 10 4 
words, the average word-error rate would be 10 divided by 10 4 or 10 -3 . 

For a channel with random, uncorrelated error patterns, average word- 
error rate is a good measure of transmission quality. For bursty channels, 
however, average word-error rate is not a meaningful measure unless 
additional higher order probabilities describing the random channel behavior 
are given. 

In this report, however, average word-error rate alone is used to 
illustrate the method. Higher order probability terms would vary the 
calculated results, but not the method employed. 
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SECTION IV 


THE MATHEMATICAL MODEL 


GENERAL 

The data transmission system attempts to send messages over the 
channel. Errors occur in the messages at an average rate, P, and cause 
retransmissions which interrupt the flow of information. 

Expressions for the average length of a sequence of error-free data 
messages, the average number of retransmissions necessary to receive an 
error-free message, and the information throughput of the communications 
system are derived below. 

AVERAGE LENGTH OF AN ERROR-FREE SEQUENCE 

Let P equal 1-P, where P is the average word-error rate. From 
the theory of combinatorial analysis, it can be shown that the probability of a 
sequence of n words being transmitted without any errors, L(n), is given by 


L(n) = P P 


( 1 ) 


The average sequence length, n, is given by 


00 



n L(n) . 


( 2 ) 


n= 0 


Substituting (1) into (2) gives 


00 



(3) 
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Equation (3) reduces to 


P P P 


(1-P) 2 P 


(4) 


Therefore, the average length of the errorless sequence is P/P words. 

AVERAGE NUMBER OF RETRANSMISSIONS WITH ERRORS 

From the mathematics of combinatorial analysis, it can be shown that 
the probability of x retransmissions to receive an error-free retrans¬ 
mission K(x) is given by 


x 


K(x) = P P 


(5) 


The average number of retransmissions to receive an error-free retrans¬ 
mission is given by 


CO 



( 6 ) 


x=0 


Substituting (5) into (6) yields 


00 



(?) 


x=0 


Equation (7) is similar to Equation (3) and reduces to 


P 


( 8 ) 


x 


P 


From Equation (4), therefore, 


1 


(9) 


x 


n 
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TRANSMISSION OF N WORD MESSAGES 


When data are transmitted in groups of N words, all N words 
must be received without errors or a retransmission is requested. The 
probability of successful transmission P(N) is given by 


- -n 
P(N) = P 


(10) 


The probability that the N word block contains at least one error, P(N), 
is given by 


—N 


(ID 


P(N) = 1-P 


This relationship is plotted in Figure 2, which shows P(N) as a function of 
N and P or 1-P. 

The average number of blocks transmitted without errors is represented 
by n^, and, in order to receive a message which is retransmitted without 
errors, the average number of retransmissions of N word blocks is rep¬ 
resented by . The transmission with errors which initiated the retrans¬ 
mission sequence is not included in x^. Substituting Equations (9) and (10) 
into Equations (4) and (8) yields 


n. 


N 


—N 


( 12 ) 


1-P 


and 



(13) 
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Figure 2. Probability Error of N Word Messages P(N) Versus N and 
Word-Error Probability P 
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INFORMATION THROUGHPUT 


Transmission of an error-free message results in N data words 
being accepted by the sink. An error-free sequence of n messages results 
in N n data words being accepted by the sink. A retransmission caused by 
the occurrence of an error and the termination of a sequence of errorless 
messages results in a transmission interruption of (D+ N) words, which is 
the sum of the round-trip transmission time and the error-detection time. 
The average interruption duration is (D + N) times the average number of 
times (1 + x) that a message must be transmitted and retransmitted in order 
to be received without errors. 


Information throughput, T , is defined as the amount of error-free 
information transferred per unit time. This quantity is the ratio of the 
average information transferred between retransmissions to the informa¬ 
tion which would be passed if there were no retransmissions. Thus, 


T 


N 


Nn 

Nn + (D+N) (1+x) 


(14) 


Substituting Equations (11) and (12) into Equation (13) yields 

N P N /1 - P N 

t n = 


N ( P N / 1-P N ) + ^ D +Nj[l + ( 1-P N /P N ) ] 


(15) 


Equation (14) may be rewritten as 


T = 
N 


-2N 
N P 


N fp 2N ) + (d+n) (l-P N ) 


(16) 
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Since each N word message contains I overhead words, which 
results in a relative fraction, R, of noninformation-carrying words per 
message where 


R 


N ’ 


(17) 


then the actual information throughput, T, is given by 


T = T n (1-R) . 

Substituting Equations (15) and (16) into Equation (17) yields 

N P 2N ( 1 - Vn ) 

N P 2N +(d + n) ( 1 - P N ) 


(18) 


(19) 


Equation (18) has been calculated by a computer using the values of P, 

N, D, and I shown in Table I. Figures 3 through 18 are plots of Equation 
(18). Each plot shows information throughput, T, versus message length, 
N, for various values of P and constant values of D and I. 


By inspecting Figures 3 through 18 it is obvious that for given 
values of P, D, and I there is a unique message length that maximizes 
the information throughput. Message lengths greater or less than the 
optimum will result in decreased throughput. Tables II through VI show 
the optimum message length to yield maximum throughput versus P 
for various values of D and I. 
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Table I 


Values of P, N, D, and I 
Used in Computer Solution of Equation (19) 
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Figure 3. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 4. Information Throughput T Versus Message Length 
of Error P 


N and Probability 
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Figure 5, Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 6. 


Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 7. Information Throughput T Versus Message Length 
of Error P 


N and Probability 
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Figure 8. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 9. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 10. Information Throughput T Versus Message Length 
of Error P 


N and Probability 
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Figure 11. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 12, Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 13. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 14. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 15. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 16. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 17. Information Throughput T Versus Message Length N and Probability 
of Error P 
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Figure 18. Information Throughput T Versus Message Length N and Probability 
of Error P 


- 27 - 









Table II 


Optimum Message Length in Words for 
F Equal to 1-1CT 2 Versus D and I 


\l 

D \ 

1 

2 

4 

8 

10 

8 

16 

16 

32 

40 

8 

16 

16 

32 

160 

8 

16 

16 

32 

640 

8 

16 

16 

32 


Table IH 


Optimum Message Length in Words for 
P Equal to 1-10"^ Versus D and I 


\ I 

D \ 

1 

2 

4 

8 

10 

32 

32 

64 

64 

40 

32 

32 

64 

64 

160 

32 

32 

64 

64 

640 

32 

32 

64 

64 
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Table IV 


Optimum Message Length in Words for 
P Equal to 1-10 -4 Versus D and I 


\I 

D \. 

1 

2 

4 

8 

10 

128 

128 

256 

256 

40 

128 

128 

256 

256 

160 

128 

128 

256 

256 

640 

128 

128 

256 

256 


Table V 

Optimum Message Length in Words for 
P Equal to 1-10“ 5 Versus D and I 


\I 

D 

1 

2 

4 

8 

10 

256 

512 

512 

1024 

40 

256 

512 

512 

1024 

160 

256 

512 

512 

1024 

640 

256 

512 

512 

1024 
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Table VI 


Optimum Message Length in Words for 
P Equal to 1-10“ Versus D and I 


\I 

d\ 

1 

2 

4 

8 

10 

1024 

1024 

2048 

2048 

40 

1024 

1024 

2048 

2048 

160 

1024 

1024 

2048 

2048 

640 

1024 

1024 

2048 

2048 
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SECTION V 


CONCLUSIONS 

In this report, a mathematical model of a data communications 
system using retransmission error control was developed. System per¬ 
formance is specified by information throughput, the amount of error-free 
data transferred per unit of time. Information throughput, T, is determined 
as a function of message length, N, taking into account transmission delay, 

D, channel error rate, P, and overhead words in the data message, I. The 
throughput expression given in Equation (19) was plotted in Figures 3 through 
18 for various values of N, D, P, and I. 

It is obvious from inspecting the curves that throughput is dependent on 
error rate and message length. If the error rate is 10“ or less, then mes¬ 
sage length is very close to optimum over a broad range of values. This 
optimum is also greater than 0.95. Lower rates than 10 -5 would bring the 
optimum closer to 1.0. For example, a nominal error rate of wire line might 
be 10 -5 . To attain a lower error rate would require forward acting error- 
correction codes which would add a fixed percent of redundancy, C, to each 
data word. Typical values of C range from 0.33 to 0.5. Since the attainable 
increase in throughput is only 0.05 or less, the use of these codes, however, 
would actually decrease the throughput. 

Typical optimum throughputs at an error rate of 10 -r , however, range 
from 0.75 to 0.05. If error-correction coding can decrease the error rate 
to 10 -5 or better, a net gain in information throughput can result. 
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A study of the advantages of error-correction coding is just one 
example of the usefulness of the model presented in this report. Other uses 
will depend on the problems of the communications system designer. 
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